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between replicons. Transposons are structurally variable, being described as simple or compound, but typically encode 
the recombinase gene flanked by DNA sequences organized in inverted orientations. Integration of transposons can 
be random or liigfily specific. Representatives such as Tn7, which are highly site-specific, have been applied to the in 
wVo movement of DNA segments between replicons (Lucklow et al., J. Virol. 67:4566-4579 (1993)). 

5 [001 5] Devlne and Boeke Nucl. Acids Res. 22:3765-3772 {1 994), discloses the construction of artificial transposons 
for the insertion of DNA segments, in vitro, into recipient DNA molecules. The system makes use of the integrase of 
yeast TY1 virus-like particles. The DNA segment of interest is cloned, using standard methods, between the ends of 
the transposon-like element TY1 . In the presence of the TY1 integrase, the resulting element integrates randomly into 
a second target DNA molecule. 

10 [0016] DNA cloning. The cloning of DNA segments currently occurs as a daily routine in many research labs and 
as a prerequisite step in many genetic analyses. The purpose of these clonings is various, however, two general 
purposes can be considered: (1) the initial cloning of DNA from large DNA or RNA segments (chromosomes, YACs, 
PGR fragments, mRNA, etc.), done in a relative handful of known vectors such as pUC, pGem, pBlueScript, and (2) 
the subcloning of these DNA segments into specialized vectors for functional analysis. A great deal of time and effort 

15 is expended both in the initial cloning of DNA segments and in the transfer of DNA segments from the initial cloning 
vectors to the more specialized vectors. This transfer is called subcloning. 

[0017] The basic methods for cloning have been known for many years and have changed little during that time. A 
typical cloning protocol is as follows: 

20 (1 ) digest the DNA of interest with one or two restriction enzymes; 



(2) gel purify the DNA segment of interest when known; 

(3) prepare the vector by cutting with appropriate restriction enzymes, treating with alkaline phosphatase, gel purify 
etc., as appropriate; 

(4) ligate the DNA segment to vector, with appropriate controls to estimate background of uncut and self-ligated 



(5) introduce the resulting vector into an E. coli host cell; 

(6) pick selected colonies and grow small cultures overnight; 

(7) make DNA minipreps; and 

(8) analyze the isolated plasmid on agarose gels (often after diagnostic restriction enzyme digestions) or by PGR. 



[0018] The specialized vectors used for subcloning DNA segments are functionally diverse. These include but are 
not limited to: vectors for expressing genes in various organisms; for regulating gene expression; for providing tags to 
aid in protein purification or to allow tracking of proteins in cells; for modifying the cloned DNA segment {e.g., generating 
deletions); for the synthesis of probes {e.g., riboprobes); for the preparation of templates for DNA sequencing; for the 
identification of protein coding regions; for the fusion of various protein-coding regions; to provide large amounts of 
the DNA of interest, etc. It is common that a particular investigation will involve subcloning the DNA segment of interest 
into several different specialized vectors. 

[0019] As known in the art, simple subclonings can be done in one day (e.g., the DNA segment is not large and the 
restriction sites are compatible with those of the subcloning vector). However, many other subclonings can take several 
weeks, especially those involving unknown sequences, long fragments, toxic genes, unsuitable placement of restriction 
sites, high backgrounds, impure enzymes, etc. Subcloning DNA fragments is thus often viewed as a chore to be done 
as few times as possible. 

[0020] Several methods for facilitating the cloning of DNA segments have been described, e.g., as in the following 
references. 

[0021] Ferguson, J., etal. Gene 16:191 (1981), discloses a family of vectors for subcloning fragments of yeast DNA. 
The vectors encode kanamycin resistance. Glones of longer yeast DNA segments can be partially digested and ligated 
into the subcloning vectors. If the original cloning vector conveys resistance to ampicillin, no purification is necessary 
prior to transformation, since the selection will be for kanamycin. 

[0022] Hashimoto-Gotoh,T., etal. Gene41:125 (1986), discloses a subcloning vector with unique cloning sites within 
a streptomycin sensitivity gene; in a streptomycin-resistant host, only plasmids with inserts or deletions in the dominant 
sensitivity gene will survive streptomycin selection. 

[0023] Accordingly, traditional subcloning methods, using restriction enzymes and ligase, are time consuming and 
relatively unreliable. Considerable labor is expended, and if two or more days later the desired subclone can not be 
found among the candidate plasmids, the entire process must then be repeated with alternative conditions attempted. 
Although site specific recombinases have been used to recombine DNA in vivo, the successful use of such enzymes 
in vitro was expected to suffer from several problems. For example, the site specificities and efficiencies were expected 
to differ In v/iro/topologically-linked products were expected; and the topology of the DNA substrates and recombination 
proteins was expected to differ significantly in vitro (see, e.g., Adams et ai, J. Mol. Biol. 226:661 -73 (1 992)). Reactions 
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constituting a functional Selectable marker when recombined across the first or second recombination site with another 
inactive fragment of at least one Selectable marker. 

[0031] The present recomblnational cloning method possesses several advantages over previous in vivo methods. 
Since single molecules of recombination products can be introduced into a biological host, propagation of the desired 
Product DNA in the absence of other DNA molecules (e.g., starting molecules, intermediates, and by-products) is more 
readily realized. Reaction conditions can be freely adjusted in vitro to optimize enzyme activities DNA molecules can 
be incompatible with the desired biological host {e.g., YACs, genomic DNA, etc.), can be used. Recombination proteins 
from diverse sources can be employed, together or sequentially. 

Brief Description of ttie Figures 

[0032] Figure 1 depicts one general method of the present invention, wherein the starting (parent) DNA molecules 
can be circular or linear. The goal is to exchange the new subcloning vector D for the original cloning vector B. It is 
desirable in one embodiment to select for AD and against all the other molecules, including the Cointegrate. The square 
and circle are sites of recombination: e.g., /oxP sites, aff sites, etc. For example, segment D can contain expression 
signals, new drug markers, new origins of replication, or specialized functions for mapping or sequencing DNA. 
[0033] Figure 2A depicts an in vitro method of recombining an Insert Donor plasmid (here, pEZC705) with a Vector 
Donor plasmid (here, pEZC726), and obtaining Product DNA and Byproduct daughter molecules. The two recombina- 
tion sites are attP and ioxP on the Vector Donor. On one segment defined by these sites is a kanamycin resistance 
gene whose promoter has been replaced by the tefOP operator/promoter from transposon TnlO. SeeSizemore etal., 
Nucl. Acids f=tes. 18{^ 0);2875 (1 990). In the absence of tet repressor protein, E. coli RNA polymerase transcribes the 
kanamycin resistance gene from the tetOR If tet repressor is present, it binds to tetOP and blocks transcription of the 
kanamycin resistance gene. The other segment of pEZC726 has the fef repressor gene expressed by a constitutive 
promoter. Thus cells transfomied by pEZC726 are resistant to chloramphenicol, because of the chloramphenicol acetyl 
transferase gene on the same segment as tetR, but are sensitive to kanamycin. The recombinase-mediated reactions 
result in separation of the tet/=i gene from the regulated kanamycin resistance gene. This separation results in kan- 
amycin resistance in cells receiving only the desired recombination products. The first recombination reaction is driven 
by the addition of the recombinase called Integrase. The second recombination reaction is driven by adding the re- 
combinase Cre to the Cointegrate (here, pEZC7 Cointegrate). 
[0034] Figure 2B depicts a restriction map of pEZC705. 
[0035] Figure 2C depicts a restriction map of pEZC726. 
[0036] Figure 2D depicts a restriction map of pEZC7 Cointegrate. 
[0037] Figure 2E depicts a restriction map of Intprod. 
[0038] Figure 2F depicts a restriction map of Intbypro. 

[0039] Figure 3A depicts an in vitro method of recombining an insert Donor plasmid (here, pEZC602) with a Vector 
Donor plasmid (here, pEZC629), and obtaining Product (here, EZC6prod) and Byproduct (here, EZC6Bypr) daughter 
molecules. The two recombination sites are toxPand loxP 51 1. One seghient of pEZC629 defined by these sites is a 
kanamycin resistance gene whose promoter has been replaced by the fefOP operator/promoter from transposon Tnl 0. 
In the absence of tet repressor protein, E. coli RNA polymerase transcribes the kanamycin resistance gene from the 
tetOP. If tet repressor is present, it binds to tetOP and blocks transcription of the kanamycin resistance gene. The other 
segment of pEZC629 has the tet repressor gene expressed by a constitutive promoter. Thus cells transformed by 
pEZC629 are resistant to chloramphenicol, because of the chloramphenicol acetyl transferase gene on the same seg- 
ment as tetR, but are sensitive to kanamycin. The reactions result in separation of the tetR gene from the regulated 
kanamycin resistance gene. This separation results in kanamycin resistance in cells receiving the desired recombina- 
tion product. The first and the second recombination events are driven by the addition of the same recombinase, Cre. 
[0040] Figure 3B depicts a restriction map of EZC6Bypr. 
[0041] Figure 3C depicts a restriction map of EZC6prod. 
[0042] Figure 3D depicts a restriction map of pEZC602. 
[0043] Figure 3E depicts a restriction map of pEZC629. 
[0044] Figure 3F depicts a restriction map of EZCBcoint. 

[0045] Figure 4A depicts an application of the in vitro method of recomblnational cloning to subclone the chloram- 
phenicol acetyl transferase gene into a vector for expression in eukaryotic cells. The Insert Donor plasmid, pEZC843, 
is comprised of the chloramphenicol acetyl transferase gene of E. coli, cloned between loxP and attB sites such that 
the toxPsite is positioned at the 5'-end of the gene. The Vector Donor plasmid, pEZCI 003, contains the cytomegalovirus 
eukaryotic promoter apposed to a /oxP site. The supercoiled plasmids were combined with lambda Integrase and Cre 
recombinase in vitro. After incubation, competent E. coli cells were transfomied with the recombinational reaction 
solution. Aliquots of transformations were spread on agar plates containing kanamycin to select for the Product mol- 
ecule (here CMVProd). 
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[0082] Insert: is the desired DNA segment (segment A of Figure 1) which one wishes to manipulate by the method 
of the present invention. The insert can have one or more gene^. 

[0083] Insert Donor: is one of the two parental DNA molecules of the present invention which carries the insert. The 
Insert Donor DNA molecule comprises the Insert flanked on both sides with recombination signals. The Insert Donor 
can be linear or circular. In one embodiment of the invention, the Insert Donor is a circular DNA molecule and further 
comprises a cloning vector sequence outside of the recombination signals (see Figure 1). 

[0084] Product: is one or both the desired daughtermoleculescomprising the>» and Dor Sand C sequences which 
are produced after the second recombination event during the recombinational cloning process (see Figure 1). The 
Product contains the DNA which was to be cloned or subcloned. 

[0085] Promoter: is a DNA sequence generally described as the 5'-region of a gene, located proximal to the start 
codon. The transcription of an adjacent DNA segment is initiated at the promoter region. A repressible promoter's rate 
of transcription decreases in response to a repressing agent. An inducible promoter's rate of transcription increases 
in response to an inducing agent. A constitutive promoter's rate of transcription is not specifically regulated, though it 
can vary under the influence of general metabolic conditions. 

[0086] Recognition sequence: Recognition sequences are particular DNA sequences which a protein, DNA, or RNA 
molecule {e.g., restriction endonuclease, a modification methylase, or a recombtnase) recognizes and binds. For ex- 
ample, the recognition sequence for Cre recombinase is /oxP which is a 34 base pair sequence comprised of two 13 
base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See 
Figure 1 of Sauer, B., Current Opinion in Biotechnology 5:521 -527 (1 994). Other examples of recognition sequences 
are the attB, attP, dttL, and attff sequences which are recognized by the recombinase enzyme X tntegrase. attBIs an 
approximately 25 base pair sequence containing two 9 base pair core-type Int binding sites and a 7 base pair overlap 
region. aftPis an approximately 240 base pair sequence containing core-type Int binding sites and arm-type Int binding 
sites as welt as sites for auxiliary proteins IHF, FIS, and Xis. See Landy, Current Opinion in Biotectinology 3:699-707 
(1993). Such sites are also engineered according to the present invention to enhance methods and products. 
[0087] Recombinase: is an enzyme which catalyzes the exchange of DNA segments at specific recombination sites. 
[0088] Recombinational Cloning: is a method described herein, whereby segments of DNA molecules are ex- 
changed, inserted, replaced, substituted or modified, in vitro or in vivo. 

[0089] Recombination proteins: include excisive or integrative proteins, enzymes, co-factors or associated proteins 
that are involved in recombination reactions involving one or more recombination sites. See, Landy (1994), infra. 
[0090] Repression cassette: is a DNA segment that contains a repressor of a Selectable marker present in the sub- 
cloning vector. 

[0091 ] Selectable marker: is a DNA segment that allows one to select for or against a molecule or a cell that contains 
it, often under particular conditions. These markers can encode an activity, such as, but not limited to, production of 
RNA, peptide, or protein, or can provide a binding site for RNA, peptides, proteins, inorganic and organic compounds 
or compositions and the like. Examples of Selectable markers include but are not limited to: (1) DNA segments that 
encode products which provide resistance against otherwise toxic compounds (e.g., antibiotics); (2) DNA segments 
that encode products which are othenwise lacking in the recipient cell (e.g., tRNA genes, auxotrophic markers); (3) 
DNA segments that encode products which suppress the activity of a gene product; (4) DNA segments that encode 
products which can be readily identified (e.g., phenotypic markers such as p-galactosidase, green fluorescent protein 
(GFP), and cell surface proteins); (5) DNA segments that bind products which are otherwise detrimental to cell survival 
and/or function; (6) DNA segments that otherwise inhibit the activity of any of the DNA segments described in Nos. 
1-5 above (e.g., antisense oligonucleotides); (7) DNA segments that bind products that modify a substrate (e.g. re- 
striction endonucleases); (8) DNA segments that can be used to isolate a desired molecule (e.g. specific protein binding 
sites); (9) DNA segments that encode a specific nucleotide sequence which can be otherwise non-functional (e.g., for 
PGR amplification of subpopulations of molecules); and/or (1 0) DNAsegments, which when absent, directly or indirectly 
confer sensitivity to particular compounds. 

[0092] Selection scheme: is any method which allows selection, enrichment, or identification of a desired Product 
or Product(s) from a mixture containing the Insert Donor, Vector Donor, and/or any intennediates, {e.g. a Cointegrate) 
Byproducts. The selection schemes of one preferred embodiment have at least two components that are either linked 
or unlinked during recombinational cloning. One component is a Selectable marker. The other component controls the 
expression in vitro of the Selectable marker, or survival of the ceil harboring the plasmid carrying the Selectable marker 
Generally, this controlling element will be a repressor or inducer of the Selectable marker, but other means forcontrolling 
expression of the Selectable marker can be used. Whether a repressor or activator is used will depend on whether the 
marker is for a positive or negative selection, and the exact arrangement of the various DNA segments, as will be 
readily apparent to those skilled in the art. A preferred requirement is that the selection scheme results in selection of 
or enrichment for only one or more desired Products. As defined herein, to select for a DNA molecule includes (a) 
selecting or enriching for the presence of the desired DNA molecule, and (b) selecting or enriching against the presence 
of DNA molecules that are not the desired DNA molecule. 
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sites or aff sites). Segment A or Dean contain at least one Selection Marl<er, expression signals, origins of replication, 
or specialized functions for detecting, selecting, expressing, mapping or sequencing DNA, where D is used in tiiis 
example. 

[0103] Examples of desired DNA segments that can be part of Element Aor D include, but are not limited to, PGR 
products, large DNA segments, genomic clones or fragments, cDNA clones, functional elements, etc., and genes or 
partial genes, which encode useful nucleic acids or proteins. Moreover, the recombinational cloning of the present 
invention can be used to make ex vivo and in vivo gene transfer vehicles for protein expression and/or gene therapy. 
[0104] In Figure 1 , the scheme provides the desired Product as containing vectors D and A, as follows. The Insert 
Donor (containing A and B) is first recombined at the square recombination sites by recombination proteins, with the 
Vector Donor (containing Cand D), to form a Co-Integrate having each of A-D-C-B. Next, recombination occurs at the 
circle recombination sites to form Product DNA (4 and D) and Byproduct DNA (Cand B). However, if desired, two or 
more different Co-integrates can be formed to generate two or more Products. 

[0105] In one embodiment of the present in vitro or in vivo recombinational cloning method, a method for selecting 
at least one desired Product DNA is provided. This can be understood by consideration of the map of plasmid pEZC726 
depicted In Figure 2. The two exemplary recombination sites are attPand loxR On one segment defined by these sites 
is a kanamycin resistance gene whose promoter has been replaced by the ferOP operator/promoter from transposon 
Tn1 0. In the absence of tet repressor protein, E. coli RNA polymerase transcribes the kanamycin resistance gene from 
the tetOP. If tet repressor Is present, It binds to tetOP and blocks transcription of the kanamycin resistance gene. The 
other segment of pEZC726 has the tet repressor gene expressed by a constitutive promoter. Thus cells transformed 
by pEZC726 are resistant to chloramphenicol, because of the chloramphenicol acetyl transferase gene on the same 
segment as tetR, but are sensitive to kanamycin. The recombination reactions result In separation of the tetR gene 
from the regulated kanamycin resistance gene. This separation results in kanamycin resistance in cells receiving the 
desired recombination Product. 

[01 06] Two different sets of plasmlds were constructed to demonstrate the in vitro method. One set, for use with Cre 
recombinase only (cloning vector 602 and subcloning vector 629 (Figure 3)) contained loxP and loxP 511 sites. A 
second set, for use with Ore and Integrase (cloning vector 705 and subcloning vector 726 (Figure 2)) contained loxP 
and aff sites. The efficiency of production of the desired daughter plasmid was about 60 fold higher using both enzymes 
than using Cre alone. Nineteen of twenty four colonies from the Cre-only reaction contained the desired product, while 
thirty eight of thirty eight colonies from the Integrase plus Cre reaction contained the desired product plasmid. 
[01 07] Other Selection Schemes A variety of selection schemes can be used that are known in the art as they can 
suit a particular purpose for which the recombinational cloning is carried out. Depending upon Individual preferences 
and needs, a number of different types of selection schemes can be used in the recombinational cloning method of 
the present invention. The skilled artisan can take advantage of the availability of the many DNA segments or methods 
for making them and the different methods of selection that are routinely used in the art. Such DNA segments Include 
but are not limited to those which encodes an activity such as, but not limited to, production of RNA, peptide, or protein, 
or providing a binding site for such RNA, peptide, or protein. Examples of DNA molecules used in devising a selection 
scheme are given above, under the definition of "selection scheme" 
[0108] Additional examples include but are not limited to: 

(i) Generation of new primer sites for PGR (e.g., juxtaposition of two DNA sequences that were not previously 
juxtaposed); 

(il) Inclusion of a DNA sequence acted upon by a restriction endonuclease or other DNA modifying enzyme, chem- 
ical, ribozyme, etc.; 

(ill) Inclusion of a DNA sequence recognized by a DNA binding protein, RNA, DNA, chemical, etc.) (e.g., for use 
as an affinity tag for selecting for or excluding from a population) (Davis, Nucl. Acids Res. 24:702-706 (1996); J. 
Wra/. 69.- 8027-8034 (1995)); 

(iv) In vitro selection of RNA ligands for the ribosomal L22 protein associated with Epstein-Barr virus-expressed 
RNA by using randomized and cDNA-derlved RNA libraries; 

(vi) The positioning of functional elements whose activity requires a specific orientation or juxtaposition (e.g., (a) 
a recombination site which reacts poorly In trans, but when placed in cis, in the presence of the appropriate proteins, 
results in recombination that destroys certain populations of molecules; (e.g., reconstitution of a promoter sequence 
that allows in vitro RNA synthesis). The RNA can be used directly, or can be reverse transcribed to obtain the 
desired DNA construct; 

(vii) Selection of the desired product by size (e.g., fractionation) or other physical property of the molecule(s); and 

(viii) Inclusion of a DNA sequence required for a specific modification (e.g., methylation)that allows its identification. 

[0109] After formation of the Product and Byproduct in the method of the present Invention, the selection step can 
be can-ied out either in vitro or in vivo depending upon the particular selection scheme which has been optionally 
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recombination, as reverse reactions of each other 

[0116] Each of the aff sites contains a 15 bp core sequence; individual sequence elennents of functional significance 
lie within, outside, and across the boundaries of this common core (Landy, A., Ann. Rev. Biochem. 5S:913 (1989)). 
Efficient recombination between the various att sites requires that the sequence of the central common region be 
5 identical between the recombining partners, however, the exact sequence is now found to be modifiable. Consequently, 
derivatives of the att site with changes within the core are now discovered to recombine as least as efficiently as the 
native core sequences. 

[0117] Integrase acts to recombine the affP site on bacteriophage lambda (about 240 bp) with the attS site on the 
E. CO// genome (about 25 bp) (Weisberg, R.A. and Landy, A, in Lambda II, p. 211 (1983), Cold Spring Harbor Labora- 
10 tory)), to produce the integrated lambda genome flanked by attL (about 100 bp) and attR (about 160 bp) sites. In the 
absence of Xis (see below), this reaction is essentially irreversible. The integration reaction mediated by integrase and 
IHF works in vitro, with simple buffer containing spermidine. Integrase can be obtained as described by Nash, H.A,, 
Methods of Enzymology T00:21 0-216 (1983). IHF can be obtained as described by Filutowicz, M., et al., Gene 147: 
149-150 (1994). 

'5 [0118] In the presence of the X protein Xis (excise) integrase catalyzes the reaction of afffland affLtoform affPand 
attB, i.e., it promotes the reverse of the reaction described above. This reaction can also be applied in the present 
Invention. 

[0119] Other Recombination Systems. Numerous recombination systems from various organisms can also be 
used, based on the teaching and guidance provided herein. See, e.g., Hoess et al., Nucleic Acids Research 14(6): 

20 2287 (1986); Abremski et al., J. Biol. Chem.261{1):391 (1986); Campbell, J. Bacteriol. f74(23):7495 (1992); Qian et 
al., J. Biol. Chem. 267(1 1 ): 7794 (A 992)\Araki etal.,J. Mol. e/o/. 225(1 ):25 (1992)). Many of these belong to the integrase 
family of recombinases (Argos et al. EMBO J 5:433-440 (1 986)). Perhaps the best studied of these are the Integrase/ 
att system from bacteriophage A, (Landy, A. (1993) Current Opinions in Genetics and Devel. 3:699-707), the Cre/toxP 
system from bacteriophage PI (Hoess and Abremski (1990) In Nucleic Acids and Molecular Biology, vol. 4. Eds.: 

25 Eckstein and Lilley, Berlin-Heidelberg: Springer-Verlag; pp. 90-1 09), and the FLP/FRT system from the Saccharomyces 
cerevisiae 2 \x circle plasmid (Broach et al. Cell 29:227-234 (1 982)). 

[0120] Members of a second family of site-specific recombinases, the resolvase family {e.g., jS, Tn3 resolvase, Hin, 
Gin, and Cin) are also known. Members of this highly related family of recombinases are typically constrained to in- 
tramolecular reactions (e.g., inversions and excisions) and can require host-encoded factors. Mutants have been iso- 
30 lated that relieve some of the requirements for host factors (Maeser and Kahnmann (1991) Mol. Gen. Genet. 230: 
1 70-1 76), as well as some of the constraints of intramolecular recombination. 

[01 21 ] Other site-specific recombinases similar to A. Int and similar to P1 Cre can be substituted for Int and Cre. Such 
recombinases are known. In many cases the purification of such other recombinases has been described in the art. 
In cases when they are not known, cell extracts can be used or the enzymes can be partially purified using procedures 

35 described for Cre and Int. 

[0122] While Cre and Int are described in detail for reasons of example, many related recombinase systems exist 
and their application to the described invention is also provided according to the present invention. The integrase family 
of site-specific recombinases can be used to provide alternative recombination proteins and recombination sites for 
the present invention, as site-specific recombination proteins encoded by bacteriophage lambda, phi 80, P22, P2, 1 86, 

40 P4 and PI . This group of proteins exhibits an unexpectedly large diversity of sequences. Despite this diversity, all of 
the recombinases can be aligned in their C-terminal halves. 

[0123] A 40-residue region near the C terminus is particularly well conserved in all the proteins and is homologous 
to a region near the C temninus of the yeast 2 mu plasmid Flp protein. Three positions are perfectly conserved within 
this family: histidine, arginine and tyrosine are found at respective alignment positions 396, 399 and 433 within the 
''s well-conserved C-terminal region. These residues contribute to the active site of this family of recombinases, and 
suggest that tyrosine-433 forms a transient covalent linkage to DNA during strand cleavage and rejoining. See, e.g., 
Argos, P. etal., EMBO J. 5:433-40 (1986). 

[0124] Alternatively, 1S231 and other Bacillus thuringiensis transposable elements could be used as recombination 
proteins and recombination sites. Bacillus thuringiensis is an entomopathogenic bacterium whose toxicity is due to the 

50 presence in the sporangia of delta-endotoxin crystals active against agricultural pests and vectors of human and animal 
diseases. Most of the genes coding forthese toxin proteins are plasmid-borne and are generally structurally associated 
with insertion sequences (IS231 , 18232, 18240, ISBT1 and ISBT2) and transposons (Tn4430 and Tn5401 ). Several of 
these mobile elements have been shown to be active and participate in the crystal gene mobility, thereby contributing 
to the variation of bacterial toxicity. 

55 [0125] Structural analysis of the iso-IS231 elements indicates that they are related to IS 1151 from Clostridium per- 
frlngens and distantly related to 184 and IS1 86 from Escherichia coll. Like the other IS4 family members, they contain 
a conserved transposase-integrase motif found in other !S families and retroviruses. 

[0126] Moreover, functional data gathered from IS231 A in Escherichia coli indicate a non-replicative mode of trans- 
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recombination, said enliancement selected from the group consisting of substantially (i) favoring excisive integration; 
(it) favoring excisive reconribination; (ii) relieving the requirement for host factors; (iii) increasing the efficiency of said 
Cointegrate DNA or Product DNA formation; and (iv) increasing the specificity of said Cointegrate DNA or Product 
DNA formation. 

5 [0136] The nucleic acid molecule preferably comprises at least one recombination site derived from attB, attP, attL 
or attR. More preferably the att site is selected from att1 , att2, or att3, as described herein. 

[0137] In a preferred embodiment, the core region comprises a DNA sequence selected from the group consisting of: 

10 (a) RKyCTGCTn^mTACNAASTSGB(m-att)(SEQIDNO:l); 

(b) AGCCW<K:TTrYK:TRTACKAACTSGB (m-attB) (SEQ ID 

N0:2); 

(c) GTrCAGCnrCKTRTACNAACTSGB (m-attR) (SEQ ID 

N0:3): 

(d) AGCCWGCTTTCKTOTACNAAGTSGB (m-attL) (SEQ ID 

20 

NO:4); 

<e) GTTCAGCrTrYKTRTACNAAGTSGB(m-attPl) (SEQ ID 

NO:5); 

or a corresponding or complementary DNA or RNA sequence, wherein R=A or G; K=G or T/U; Y^C orT/U; W=A or 
T/U; N=A or C or G or T/U; S=Cor G; and B=C or G or T/U, as presented in 37 C.F.R. §1 .822, wherein the core region 
does not contain a stop codon in one or more reading frames. 
30 [0138] The core region also preferably comprises a DNA sequence selected from the group consisting of: 

(a) ACCCTGCTn"lJTQTACAAACTTGT(fflttBl)(SEQIDNO:6); 

(b) ACCCTCCTTTCTTGTACAAACrTGT<attB2)(SEQIDNO:7); 

(c) AC(X:AGCTITCrTGTACAAACTTGT (altB3) (SEQ ID NO:8); 

(d) GTTCAGCn'l'i'i 1 GTACAAACTTGT (attRl) (SEQ IDNO:9); 

(e) GTTCAGCnTCTTGTACAAACTTGT (attR2) (SEQID NO:10); 

(f) GTTCAGCTTTCnTGTACAAAGTTGG (attR3) (SEQ ID 
N0:11); 
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Example 1: Recomblnational Cloning Using Cre and Cre & Int 

[0149] Two pairs of plasmids were constructed to do the m vitro recombinational cloning metliod in two different 
ways. One pair, pEZC705 and pEZC726 (Figure 2A), was constructed with loxP and aff sites, to be used with Cre and 

5 A. integrase. The other pair, pEZC602 and pEZC629 (Figure 3A), contained the loxP (wild type) site for Cre, and a 
second mutant lox site, loxP 51 1, which differs from loxP in one base (out of 34 total). The minimunn requirement for 
recombinational cloning of the present invention is two recombination sites in each plasmid, in general Xand Y, and 
X' and Y'. Recombinational cloning tal<es place if either or both types of site can recombine to form a Cointegrate (e. 
g. Xan6 X), and if either or both (but necessarily a site different from the type forming the Cointegrate) can recombine 

10 to excise the Product and Byproduct plasmids from the Cointegrate (e.g. Vand Y"). It is important that the recombination 
sites on the same plasmid do not recombine. It was found that the present recombinational cloning could be done with 
Cre alone. 

Cre-Only 

15 

[0150] Two plasmids were constructed to demonstrate this conception (see Figure 3A). pEZC629 was the Vector 
Donor plasmid. It contained a constitutive drug marker (chloramphenicol resistance), an origin of replication, /oxP and 
loxP 511 sites, a conditional drug marker (kanamycin resistance whose expression is controlled by the operator/pro- 
moter of the tetracycline resistance operon of transposon Tn1 0) and a constitutively expressed gene for the tet repres- 
20 sor protein, tetR. E. coli cells containing pEZC629 were resistant to chloramphenicol at 30 jig/ml, but sensitive to 
kanamycin at 100 tig/ml. pEZC602 was the Insert Donor plasmid, which contained a different dnjg marker (ampicillin 
resistance), an origin, and loxP and loxP 51 1 sites flanking a multiple cloning site. 
[0151] This experiment was comprised of two parts as follows: 

25 Part I: About 75 ng each of pEZC602 and pEZC629 were mixed in a total volume of 30 n.1 of Cre buffer (50 mM 

Tris-HCI pH 7.5, 33 mM NaCI, 5 mM spermidine-HCI, 500 jj,g/ml bovine serum albumin). Two 10 p.1 aliquots were 
transferred to new tubes. One tube received 0.5 ^lI of Cre protein (approx. 4 units per ^.1; partially purified according 
to Abremski and Hoess, J. Biol. Chem. 25P:1509 (1984)). Both tubes were incubated at 37°C for 30 minutes, then 
70°C for 10 minutes. Aliquots of each reaction were diluted and transformed into DH5a. Following expression, 

30 aliquots were plated on 30 ^g/ml chloramphenicol; 100 ng/ml ampicillin plus 200 ng/ml methicillin; or 100 ng/ml 

kanamycin. Results: See Table I.The reaction without Cre gave 1,11x10^ ampicillin resistant colonies (from the 
Insert Donor plasmid pEZC602); 7.8x1 0^ chloramphenicol resistant colonies (from the Vector Donor plasmid 
pEZC629); and 140 kanamycin resistant colonies (background). The reaction with added Cre gave 7.5x1 0^ amp- 
icillin resistant colonies (from the Insert Donor plasmid pEZC602); 6.1x105 chloramphenicol resistant colonies 

35 (from the Vector Donor plasmid pEZC629); and 760 kanamycin resistant colonies (mixture of background colonies 

and colonies from the recombinational cloning Product plasmid). Analysis: Because the number of colonies on 
the kanamycin plates was much higher in the presence of Cre, many or most of them were predicted to contain 
the desired Product plasmid. 



45 



\ Enzyme 


Ampicillin 


Chloramphenicol 


Kanamycin 


Efficiency 


None 


1.1x106 


7.8x105 


140 


140/7.8x105= 0.02% 


Cre 


7.5x105 


6.1x105 


760 


760/6.1x105= 0.12% 



[0152] Part li: Twenty four colonies from the "+ Cre" kanamycin plates were picked and inoculated into medium 
containing 100 ^.g/ml kanamycin. Minipreps were done, andthe mintprep DNAs, uncut or cut with Sma\ orH/ndlll, were 
electrophoresed. Results: 19 of the 24 minipreps showed supercoiled plasmid of the size predicted for the Product 
50 plasmid. All 19 showed the predicted Smal and H/ndlll restriction fragments. Analysis: The Cre only scheme was 
demonstrated. Specifically, it was determined to have yielded about 70% (1 9 of 24) Product clones. The efficiency was 
about 0.1% (760 kanamycin resistant clones resulted from 6.1x105 chloramphenicol resistant colonies). 

Cre Plus Integrase 

55 

[0153] The plasmids used to demonstrate this method are exactly analogous to those used above, except that 
pEZC726, the Vector Donor plasmid, contained an attP site in place of loxP 511, and pEZC705, the Insert Donor 
plasmid, contained an attB site in place of loxP 511 (Figure 2A). 
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(5026 base pairs), CMVProd. Restriction digestion with Nco\ gave the fragments predicted for the chloramphenicol 
acetyl transferase cloned downstream of the CMV promoter for all six plasmids. 



Example 3: Subcloned DNA Segments Flanked by attB Sites Without Stop Codons 



Part I: Background 



[0156] The above examples are suitable for transcriptional fusions, in which transcription crosses recombination 
sites. However, both attR and loxP sites contain multiple stop codons on both strands, so translational fusions can be 
difficult, where the coding sequence must cross the recombination sites, (only one reading frame is available on each 
strand of loxP sites) or impossible (in attR or attL). 

[0157] A principal reason for subclonlng is to fuse protein domains. For example, fusion of the glutathione S-trans- 
ferase (GST) domain to a protein of interest allows the fusion protein to be purified by affinity chromatography on 
glutathione agarose (Pharmacia, Inc., 1 995 catalog), if the protein of interest is fused to runs of consecutive histidines 
(for example Hls6), the fusion protein can be purified by affinity chromatography on chelating resins containing metal 
ions (Qiagen, Inc.). It is often desirable to compare amino terminal and carboxy terminal fusions for activity, solubility, 
stability, and the like. 

[0158] The attB sites of the bacteriophage X integration system were examined as an alternative to loxP sites, be- 
cause they are small (25 bp) and have some sequence flexibility (Hash, H.A. et al., Proc. Natl. Acad. Sci. USA 84: 
4049-4053 (1 987). It was not previously suggested that multiple mutations to remove all stop codes would result in 
useful recombination sites for recombinational subclonlng. 

[0159] Using standard nomenclature for site specific recombination in lambda bacteriophage (Weisber, in Lambda 
III, Hendrix, et al., eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989)), the nucleotide regions that 
participate in the recombination reaction in an E. co// host cell are represented as follows: 



attP --Pl--Hl--P2--X--H2--C-0-C--H'--P'l--P'2--P'3-- 

+ 

attB __B-0-B'-- 

Int, IHP n Xis, Int, IHF 

attR --P1--H1--P2--X--H2--C-0-B' -- 

+ 

attL --B-0-C--H' --P'l--P'2--P"3--, 

Where: O represents the 15 bp core DNA sequence found in both the phage and E. co// genomes; B and B' represent 
approximately 5 bases adjacent to the core in the E. co// genome; and P1, H1, P2, X, H2, C, C, H', P'1, P'2, and P'3 
represent known DNA sequences encoding protein binding domains in the bacteriophage X genome. 
[0160] The reaction is reversible in the presence of the protein Xis (excisionase); recombination between attL and 
attR precisely excise the X genome from its integrated state, regenerating the circular A. genome containing attP and 
the linear E. co// genome containing attB. 



Part 11: Construction and Testing of Plasmids Containing Mutant att Sites 

so [0161] Mutant attL and attR sites were constructed. Importantly, Landy et al. (Ann. Rev. Biochem. 58:913 (1989)) 
observed that deletion of the P1 and HI domains of attP facilitated the excision reaction and eliminated the integration 
reaction, thereby making the excision reaction irreversible. Therefore, as mutations were introduced in attR, the PI 
and H1 domains were also deleted. attR sites in the present example lack the PI and H1 regions and have the Ndel 
site removed (base 27630 changed from C to G), and contain sequences corresponding to bacteriophage X coordinates 

55 2761 9-27738 (GenBank release 92.0, bg:LAMCG, "Complete Sequence of Bacteriophage Lambda"). 
[0162] The sequence of attB produced by recombination of wild type attL and attR sites is: 
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Table 3 (continued) 



Vector donor att site 


Gene donor att site 


# of l<anamycin resistant colonies* 




attLwt(pEZC1313) 
attL2 (PEZC1317) 
attL2 (PEZC1321) 


0 
0 

209 



{*1% of each transformation was spread on a kanamycin plate.) 



[0166] The above data show that whereas the wild type att and att1 sites recombine to a small extent, the atti and 
att2 sites do not recombine detectably with each other 

[0167] Part III. Recombination was demonstrated when the core region of both attB sites flanking the DNA segment 
of interest did not contain stop codons. The physical state of the participating plasmids was discovered to influence 
recombination efficiency. 

[0168] The appropriate att sites were moved into pEZC705 and pEZC726 to make the plasmids pEZC1405 (Figure 
5G) (attR1 and attR2) and pEZCI 502 (Figure 5H) (attL1 and attL2). The desired DNA segment in this experiment was 
a copy of the chloramphenicol resistance gene cloned between the two attL sites of pEZC1 502. Pairs of plasmids were 
recombined in vitro using Int, Xis, and IHF (no Cre because no loxP sites were present). The yield of desired kanamycin 
resistant colonies was determined when both parental plasmids were circular, or when one plasmid was circular and 
the other linear as presented in Table 4: 



Table 4 



Vector donor'' 


Gene donor"" 


Kanamycin resistant colonies^ 


Circular pEZC1 405 


None 


30 


Circular pEZCI 405 


Circular pEZC1502 


2680 


Linear pEZC1 405 


None 


90 


Linear pEZC1 405 


Circular pEZC1502 


1 72000 


Circular pEZCt 405 


Linear pEZCI 502 


73000 



30 DMAs were purified with Qiagen columns, concentrations determined by A260, and linearized with Xba I (pEZC1405) or AlwN I (pE2C1 502). Each 

reaction contained 100 ng of the indicated DNA. All reactions (10 ^1 total) contained 3 jil of enzyme mix (Xis, Int, and IHF). After incubation (45 
minutes at 25", 10 minutes at 65"), one ^il was used to transform E. co//' DH5ct cells. 

^Number of colonies expected if the entire transformation reaction (1 ml) had been piated. Either 100 jii or 1 (i! of the transformations were actually 
plated. 

35 [0169] Analysis: Recombinational cloning using mutant attR and attL sites was confinned. The desired DNA seg- 
ment is subcloned between attB sites that do not contain any stop codons in either strand. The enhanced yield of 
Product DNA (when one parent was linear) was unexpected because of earlier observations that the excision reaction 
was more efficient when both participating molecules were supercoiled and proteins were limiting (Nunes-Duby etal., 
Cell 50:779-788 (1987). 

40 

Example 4: Demonstration of Recombinational Cloning Without Inverted Repeats 
Part I: Rationale 

[0170] The above Example 3 showed that plasmids containing inverted repeats of the appropriate recombination 
sites (for example, attL1 and attL2 in plasmid pEZC1502) (Figure 5H) could recombine to give the desired DNA segment 
flanked by attB sites without stop codons, also in inverted orientation. A concern was the in vivo and in vitro influence 
of the inverted repeats. For example, transcription of a desired DNA segment flanked by attB sites in inverted orientation 
could yield a single stranded RNA molecule that might form a hairpin structure, thereby inhibiting translation. 
^° [0171] Inverted orientation of similar recombination sites can be avoided by placing the sites in direct repeat arrange- 
ment att sites. If parental plasmids each have a wild type attL and wild type attR site, in direct repeat the Int, Xis, and 
IHF proteins will simply remove the DNA segment flanked by those sites in an intramolecular reaction. However, the 
mutant sites described in the above Example 3 suggested that it might be possible to inhibit the intramolecular reaction 
while allowing the intennolecular recombination to proceed as desired. 
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[0179] In the general recombinational cloning scheme, in which the vector donor contains two segments C and D 
separated by recombination sites, selection for the desired product depends upon selection for the presence of segment 
D, and the absence of segment C. In the original Example segment D contained a drug resistance gene (Km) that was 
negatively controlled by a repressor gene found on segment C. When C was present, cells containing D were not 
resistant to l<anamycin because the resistance gene was turned off. 

[01 80] The Dpn I gene is an example of a toxic gene that can replace the repressor gene of the above embodiment. 
If segment C expresses the Dpn I gene product, transforming plasmid CD into a dam+ host l<ills the cell. If segment D 
is transferred to a new plasmid, for example by recombinational cloning, then selecting for the drug marker will be 
successful because the toxic gene is no longer present. 

Part II: Construction of a Vector Donor Using Dpn I as a Toxic Gene 

[0181] The gene encoding Dpn I endonuclease was amplified by PGR using primers 5'CCA CCA CAA ACG CGT 
CCA TGG AAT TAG ACT TTA ATT TAGS' (SEQ. ID NO: 17) and 5'CCA CCA CAA GTC GAC GCA TGC CGA CAG 
CCTTCC AAA TGT3' (SEQ. ID NO: 18) and a plasmid containing the Dpn I gene (derived from plasmids obtained 
from Sanford A. Lacks, Brookhaven National Laboratory, Upton, New York; also available from American Type Culture 
Collection as ATCC 67494) as the template. 

[0182] Additional mutations were introduced into the B and B' regions of attL and attR, respectively, by amplifying 
existing attL and attR domains with primers containing the desired base changes. Recombination of the mutant attLS 
(made with oligo Xis115) and attRS (made with oligo Xis112) yielded attBS with the following sequence (differences 
from attBI in bold): 

B OB' 
ACCCA GCTTTCTTGTACAAA GTGGT (SEQ. ID NO: 8) 

tGggT cgaaaGaacatgttt CaCca 



30 The attLS sequence was cloned in place of attL2 of an existing Gene Donor plasmid to give the plasmid pEZC2901 
(Figure 7A). The attRS sequence was cloned in place of attR2 in an existing Vector Donor plasmid to give plasmid 
pEZC2913 (Figure 7B) Dpn I gene was cloned into plasmid pEZC2913 to replace the tet repressor gene. The resulting 
Vector Donor plasmid was named pEZC31 01 (Figure 7C). When pEZCSI 01 was transformed into the dam- strain SCS 
110 (Stratagene), hundreds of colonies resulted. When the same plasmid was transformed into the dam+ strain DH5a, 

35 only one colony was produced, even though the DH5a cells were about 20 fold more competent than the SCS1 10 
cells. When a related plasmid that did not contain the Dpn I gene was transfomned into the same two eel) lines, 28 
colonies were produced from the SCS1 1 0 cells, while 448 colonies resulted from the DH5a cells. This is evidence that 
the Dpn I gene is being expressed on plasmid pEZCSIQI (Figure 7C), and that it is killing the dam+ DH5a cells but 
not the dam- SCS1 10 cells. 

40 

Part III: Demonstration of Recombinational Cloning Using Dpn I Selection 

[0183] A pair of plasmids was used to demonstrate recombinational cloning with selection for product dependent 
upon the toxic gene Dpn I. Plasmid pEZCSI 01 (Figure 7C) was linearized with Mlu I and reacted with circular plasmid 
45 pEZC2901 (Figure 7A). A second pair of plasmids using selection based on control of drug resistance by a repressor 
gene was used as a control: plasmid pEZC 1802 (Figure 7D) was linearized with Xba I and reacted with circular plasmid 
pEZC1502 (Figure 5H). Eight microliter reactions containing the same buffer and proteins Xis, Int, and IHF as in previous 
examples were incubated for 45 minutes at 25°C, then 10 minutes at 75°C, and 1 jil aliquots were transformed into 
DH5a (i.e., dam+) competent cells, as presented in Table 6. 

50 

Table 6 



Reaction # 


Vector donor 


Basis of selection 


Gene donor 


Colonies 


1 


pEZCSI 01/Mlu 


Dpn 1 toxicity 




3 , 


2 


pEZCSI 01/Mlu 


Dpn 1 toxicity 


Circular pEZC2901 


4000 


3 


pEZC1802/Xba 


Tet repressor 




0 
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Part III: PCR of CAT and phoA Genes 



[0190] Primers were obtained from Life Technologies, Inc., to amplify the chloramphenicol acetyl transferase (CAT) 
gene from plasmld pACYC184, and phoA, the alkaline phosphatase gene from E. coli. The primers had 12-base 5' 

5 extensions containing uracil bases, so that treatment of PCR products with uracil DNA glycosylase (UDG) would weak- 
en base pairing at each end of the DNAs and allow the 3' strands to anneal with the protruding 3' ends of the rf1 , 2, 
and 3 vectors described above. The sequences of the primers (all written 5' - 3') were: CAT left, UAU UUU CAG GGU 
ATG GAG AAA AAA ATC ACT GGA TAT ACC (SEQ. ID NO:27); CAT right, UCC CAC UUA UUA CGC CCC GCC 
CTG CCA CTC ATC (SEQ. ID NO:28); phoA left, UAU UUU CAG GGU ATG CCT GTT CTG GAA AAC CGG (SEQ. 

10 ID NO:29); and phoA right, UCC CAC UUA UUA TTT CAG CCC CAG GGC GGC TTT C (SEQ. ID NO:30). The 
primers were then used for PCR reactions using known method steps (see, e.g., U.S. patent No. 5,334,515, entirely 
incorporated herein by reference), and the polymerase chain reaction amplification products obtained with these prim- 
ers comprised the CAT or phoA genes with the initiating ATGs but without any transcriptional signals. In addition, the 
uracil-containing sequences on the amino termini encoded the cleavage site for TEV protease (Life Technologies, Inc.), 

15 and those on the carboxy tenninal encoded consecutive TAA nonsense codons. 

[0191] Unpurified PCR products (about 30 ng) were mixed with the gel purified, linear rfl , rf2, or rf3 cloning vectors 
(about 50 ng) in a 1 0 ^il reaction containing 1 X REact 4 buffer (LTI) and 1 unit UDG (LTI). After 30 minutes at 37°C, 1 
li\ aliquots of each reaction were transfonned into competent E. co// DH5a cells (LTI) and plated on agar containing 
50 (i.g/ml kanamycin. Colonies were picked and analysis of miniprep DNA showed that the CAT gene had been cloned 

20 in reading frame 1 (pEZC3601)(Figure 8C), reading frame 2 (pEZC3609)(Figure BD) and reading frame 3 (pEZC361 7) 
(Figure 8E), and that the phoA gene had been cloned in reading frame 1 (pEZC3606)(Figure 8F), reading frame 2 
(pEZC3613)(Figure8G) and reading frames (pEZC3621)(Figure 8H). 

Part IV: Subcloning of CAT or phoA from UDG Cloning Vectors into a GST Fusion Vector 

25 

[01 92] Plasmids encoding fusions between GST and either CAT or phoA in all three reading frames were constructed 
by recombinational cloning as follows. Miniprep DNA of GST vector donor pEZC3501 (Figure 8B) (derived from Phar- 
macia plasmid pGEX-2TK as described above) was linearized with Cla I. About 5 ng of vector donor were mixed with 
about 1 0 ng each of the appropriate circular gene donor vectors containing CAT or phoA In 8 ^.1 reactions containing 
30 buffer and recombination proteins Int, Xis, and IHF (above). After incubation, 1 ^il of each reaction was transformed 
into E. CO// strain DHSa and plated on ampicillin, as presented in Table 7. 



Table 7 



DNA 


Colonies (10% of each transformation) 


Linear vector donor (pEZC3501/Cla) 


0 


Vector donor + CAT rfl 


110 


Vector donor + CAT rf2 


71 


Vector donor + CAT rf3 


148 


Vector donor + phoA rfl 


121 


Vector donor + phoA rf2 


128 


Vector donor + phoA rf3 


31 



Part V: Expression of Fusion Proteins 



[0193] Two colonies from each transformation were picked into 2 ml of rich medium (CircleGrow, Biol 01 Inc.) in 17 
X 100 mm plastic tubes (Falcon 2059, Becton Dickinson) containing 100 (xg/ml ampicillin and shaken vigorously for 
about 4 hours at 37°C, at which time the cultures were visibly turbid. One ml of each culture was transferred to a new 
tube containing 10 jil of 10% (w/v) IPTG to induce expression of GST. After 2 hours additional incubation, all cultures 
bad about the same turbidity; the A600 of one culture was 15. Cells from 0.35 ml each culture were harvested and 
treated with sample buffer (containing SDS and p-mercaptoethanol) and aliquots equivalent to about 0.15 A600 units 
of cells were applied to e Novex 4-20% gradient polyacrylamide gel. Following electrophoresis the gel was stained with 
Coomassie blue. 

[0194] Results: Enhanced expression of single protein bands was seen for all 12 cultures. The observed sizes of 
these proteins correlated well with the sizes predicted for GST being fused (through attB recombination sites without 
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tion protein. 

14. A method according to Claim 1 3 wherein the first recombination protein recombines the first and third recombination 
sites and the second recombination protein recombines the second and fourth recombination sites. 

5 

15. A method according to Claim 14 wherein the first and second recombination proteins are used consecutively. 

16. A method as claimed in Claim 1, wherein said recombination protein is Int. 

10 17. A method as claimed in Claim 13, wherein the recombination protein comprises Int and IHF and, optionally, Xis. 

18. Use of a nucleic acid comprising a first DNA segment and a second DNA segment, wherein the first or second 
segment is flanked by at least a first and a second engineered recombination sites that do not recombine with 
each other, in an in vitro method according to Claim 1 . 

15 

19. Use according to Claim 18 wherein the first and second engineered recombination sites have been engineered 
so as to have one or more properties selected from:- 

(a) enhanced specificity or efficiency of recombination; 
20 (b) decreased reverse recombination; 

(c) absence of translation stop codon; and 

(d) absence of inverted repeats so as to avoid hairpin formation. 

20. Use according to Claim 1 9 wherein enhanced specificity or efficiency of recombination is conferred by one or more 
25 properties selected from:- 

(a) favoured excisive integration; 

(b) favoured excisive recombination; and 

(c) relieved requirement for host factors. 

30 

21. Use according to Claim 19 or 20 wherein the recombination sites are engineered to confer enhanced specificity 
or efficiency of recombination in vitro. 

22. Use according to any of Claims 1 8-21 , wherein said first recombination site is selected from the group consisting 
35 of an att site and a /ox site. 

23. Use according to Claim 22, wherein said second recombination site is selected from the group consisting of an att 
site and a /ox site. 

40 24. Use according to any of Claims 1 8-21 , wherein a core region of at least one of the recombination sites comprises 
a DNA sequence selected from the group consisting of: 

(a) RKYCWGCTTTYKTRTACNAASTSGB (m-att) (SEQ ID NO:l); 



(b) AGCCWGCTTTYKTRTACNAACTSGB (m-attB) (SEQ ID NO:2); 

(c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID NO:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-atlL) (SEQ ID NO:4); 

(e) GTTCAGCTTTYKTRTACNAAGTSGB (m-attPl) (SEQ ID NO:5); 



and a corresponding or complementary DNA or RNA sequence, wherein R=A or G; K=G or T/U; Y=C orT/ 
U; W=A or T/U; N=A or C or G or T/U; S=C or G; and B=C or G or T/U. 
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consisting of a cloning site, a restriction site, a promoter, an operon, an origin of replication, a functional DNA, an 
antisense RNA, a PGR fragment, a protein or a protein fragment. 

32. Use according to Claim 30, wherein said first and second engineered recombination sites have been engineered 
to enhance recombination efficiency. 

33. A kit comprising a container being compartmentalized and including in close confinement therein a first compart- 
ment containing a nucleic acid as defined in any of Claims 1 8-32, and at least one additional compartment con- 
taining at least one recombination protein capable of recombining a DNA segment comprising at least one of said 
recombination sites. 

34. A kit according to Claim 33, comprising a first compartment containing a nucleic acid that is a Vector Donor DNA, 
a second compartment containing a nucleic acid that is an Insert Donor DNA molecule, and said additional com- 
partment containing said at least one recombination protein. 

35. An in vitro method for apposing an expression signal and a gene or partial gene comprising: 

(a) mixing a first nucleic acid molecule comprising said expression signal and at least a first recombination 
site, and a second nucleic acid molecule comprising said gene or partial gene and at least a second recom- 
bination site; and 

(b) incubating said mixture in the presence of at least one recombination protein under conditions sufficient 
to cause recombination of at least said first and second recombination sites thereby apposing said expression 
signal and said gene or partial gene. 

36. A method according to Claim 35, wherein said at least one recombination protein is encoded by a bacteriophage 
selected from the group consisting of bacteriophage lambda, phiSO, P22, P2, 186, P4 and P1 . 

37. A method according to Claim 35 wherein said at least one recombination protein is encoded by bacteriophage 
lambda. 

38. A method according to any Claims 35-37 using a first nucleic acid as defined in any of Claims 1 8-32 and a second 
nucleic acid as defined in any of Claims 18-32. 

39. A composition comprising a nucleic acid molecule as defined in any of Claims 18-32 and a first recombination 
protein. 

40. A composition according to Claim 39 comprising nucleic acid molecules that are Vector Donor DNAs and Insert 
Donor DNAs, and said first recombination protein. 

41. A composition according to Claim 39 or 40 further comprising a second recombination protein different from the first. 

42. Use of a methodaccordingtoany of Claims 1 -17 for changing vectors, apposing promoters with genes, constructing 
genes forfusion proteins, changing copy number, changing repiicons, cloning into phages orcloning PCR products, 
genomic DNAs or cDNAs, 



Patentanspruche 

1. Verfahren zur Herstellung eines Fusionsprodukt-DNA-Molekuls, umfassend Kombinieren in vitro: 

(i) ein Insertion-Donor-DNA-Molekiil, das ein gewiinschtes DNA-Segment umfasst, das durch eine erste Re- 
kombinationsstelle und eine zweite Rekombinationsstelle fiankiert ist, wobei die erste und die zwelte Rekom- 
binationsstelle nicht miteinander rekombinieren; 

(ii) ein Vektor-Donor-DNA-Molekul, das eine dritte Rekombinationsstelle und eine vierte Rekombinationsstelle 
enthalt, wobei die dritte und die vierte Rekombinationsstelle nicht miteinander rekombinieren; und 

(iii) wenigstens ein spezifisches Rekombinationsprotein, dasfahig ist, die erste und dritte RekombinationsstellG 
und die zweite und vierte Rekombinationsstelle zu rekombinieren; 
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puliert wurden, um eine oder mehrere Eigenschaften aufzuweisen, ausgewahit aus: 

(a) verbesserter Spezifizitat oder Rekombinationseffizient 

(b) verringerter reverser Rekombination; 

(c) Abwesenhelt von Translationsstopcodons; und 

(d) Abwesenheit von invertierten Wiederholungen, um Haarnadelbildung zu vernneiden. 

20. Verwendung gemaB Anspruch 19, bei der die verbesserte Spezifizitat oder Rekombinationseffizienz durch eine 
Oder mehrere Eigenschaften ubertragen werden, ausgewahit aus: 

(a) bevorzugte exzisive Integration; 

(b) bevorzugte exzisive Rekombination; und 

(c) erieichterte Anforderung fiir Wirtsfaktoren. 

21. Verwendung gema3 Anspruch 19 oder 20, bei derdie Rekombinationsstellen genmanipuliertsind, um verbesserte 
Spezifizitat oder Rekombinationseffizient in vitro zu ubertragen. 

22. Verwendung gemaB irgendeinem der Anspriiche 18-22, beiderdieerste Rekombinationsstelle ausgewahit ist aus 
der Gruppe, bestehend aus einer aff-Stelle und einer /ox-Stelle. 

23. Verwendung gemaB Anspruch 22, bei der die zweite Rekombinationsstelle ausgewahit ist aus der Gruppe, beste- 
hend aus einer att-Stelle und einer /ox-Stelle. 

24. Verwendung gemaB irgendeinem der Anspriiche 18-21, bei der eine Kernregion von wenigstens einer der Rekom- 
binationsstellen eine DNA-Sequenz umfasst, die ausgewahit ist aus der Gruppe, bestehend aus: 



und einer entsprechenden oder komplementaren DNA- oder RNA-Sequenz, wobei R=A oder G; K=G oderT/U; 
Y=C oderT/U; W=A oderT/U; N=A oder C oder G oderT/U; S=C oder G; und B=C oder G oderT/U. 

25. Verwendung gemaB irgendeinem der Anspruche 18 bis 21, bei der eine Kernregion von wenigstens einer der 
Rekombinationsstellen eine DNA-Sequenz umfasst, die ausgewahit ist aus der Gruppe, bestehend aus: 



/(b) 
(c) 
(d) 
(e) 



(a) RKYCWGCTTTYKTRTACNAASTSGB (m-att) (SEQ ID Nr.; i); 
, AGCCWGCrnryKTRTACNAACTSGB (m-attB) (SEQ Id 'nf.: 2); 

GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID ! Nr.: 3); 
I AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID Nr.: 4); 
f GTTCAGCTTTYKTRTACNAAGTSGB (m-attPl) {SEQ ID Nr.: 5); 
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ist und ein gewiinschtes DNA-Molekiil umfasst. 

31 . Verwendung gemaB Anspruch 30, bei der das gewiinschte DNA-Segment fur wenigstens eines kodiert, welches 
ausgewahit ist aus der Gruppe, bestehend aus einer Klonierungsstelle, einer Restriktionsstelle, einem Promoter, 

5 einem Operon, einem Replikationsursprung, einer funktionellen DNA, einer Antisense-RNA, einem PCR-Frag- 

ment, einem Protein oder einem Proteinfragment. 

32. Verwendung gemaB Ansprucli 30, bei der die erste und zweite genmanipulierte Rekombinationssteiie genmani- 
puliert wurden, um Rekombtnationseffizienz zu verbessern. 

10 

33. Kit, welcher ein Gefa3 umfasst, das unterteilt ist und in dicliter Begrenzung darin eine erste Kammer, die eine wie 
in irgendeinem der Anspruclie 18-32 definierte Nukleinsaure entlialt, und wenigstens eine zusatziiche Kammer 
einschlieBt, die wenigstens ein Rekombinationsprotein enthalt, dasfahig ist, ein DNA-Segment zu rekombinieren, 
das wenigstens eine der Rekombinationsstellen umfasst. 

15 

34. Kit gema3 Anspruch 33, weicher eine erste Kammer, die eine Nukleinsaure enthalt, die eine Vektor-Donor-DNA 
ist, und eine zweite Kammer umfasst, die eine Nukleinsaure enthalt, die ein insertion-Donor-DNA-Molekiil ist, 
wobei die zusatziiche Kammer das wenigstens eine Rekombinationsprotein enthalt. 

20 35. In v/lro Verfahren zum Appositionieren eines Expressionssignals und eines Gens oder partiellen Gens, das um- 
fasst: 

(a) Mischen eines ersten Nukleinsauremolekuls, das das Expressionsignal und wenigstens eine erste Rekom- 
binationssteiie umfasst, und eines zweiten Nukleinsauremolekuls, das das Gen oder partielle Gen und we- 

25 nigstens eine zweite Rekombinationssteiie umfasst; und 

(b) Inkubieren der Mischung in der Gegenwart von wenigstens einem Rekombinationsprotein unter ausrei- 
chenden Bedingungen, um Rekombination von wenigstens der ersten und zweiten Rekombinationssteiie zu 
veranlassen, wobei dadurch das Expressionssignal und das Gens oder partielle Gen appositioniert werden. 

30 36. Verfahren gemaB Anspruch 35, bei dem das wenigstens eine Rekombinationsprotein durch einen Bakteriophage 
kodiert wird, der ausgewahit ist aus der Gruppe, bestehend aus Bakteriophage Lambda, phiBO, P22, P2, 1 86, P4 
und PI . 

37. Verfahren gemaB Anspruch 35, bei dem das wenigstens eine Rekombinationsprotein durch Bakteriophage Lamb- 
35 da kodiert wird. 

38. Verfahren gemaB irgendeinem der Anspriiche 35-37 unter Verwendung einer ersten, wie in irgendeinem der An- 
spruche 18-32 definierten Nukleinsaure und einer zweiten, wie in irgendeinem der Anspruche 18-32 definierten 
Nukleinsaure. 

40 

39. Zusammensetzung, die ein wie in irgendeinem der Anspriiche 18-32 definiertes Nukleinsauremolekiii und ein 
erstes Rekombinationsprotein umfasst. 

40. Zusammensetzung gemaB Anspruch 39, die Nukleinsauremolekiile, die Vektor-Donor-DNA's und Insertion-Do- 
45 nor-DNA's sind, und das erste Rekombinationsprotein umfasst. 

41 . Zusammensetzung gemaB Anspruch 39 oder 40, die weiterhin ein zweites, zu dem ersten verschiedenes Rekom- 
binationsprotein umfasst. 

50 42. Verwendung eines Verfahrens gemaB irgendeinem der Anspruche 1 -1 7zum Andern von Vektoren, Appositionieren 
von Promotoren mit Genen, Konstruieren von Genen fiir Fusionsproteine, Andern der Kopienanzahl, Andern von 
Replikons, Klonieren in Phagen oder Klonieren von PCR-Produkten, Genom-DNA's odercDNA's. 



55 Revendications 

1 . Precede de fabrication d'une molecule d'ADN fomnant Cointegrat, comprenant la combinaison in vitro 
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16. Proc6cl6 selon la revendication 1 , dans lequel ladite prot6ine de recombinaison est Int. 

17. Proc6d6 selon la revendication 13, dans lequel la prot6ine de recombinaison comprend Int et IHF et, facultative- 
ment, Xis. 

18. Utilisation d'un acide nucleique comprenant un premier segment d'ADN et un second segment d'ADN, dans la- 
quelle solt le premier ou le second segment, est flanque d'au moins un premier et d'un deuxieme sites de recom- 
binaison modifies qui ne se recombinant pas I'un avec I'autre, dans un proc6d6 in vitro selon ta revendication 1 . 



19. Utilisation seion ia revendication 18, dans laquelle les premier et deuxieme sites de recombinaison modifies ont 
6te modifies de fagon a presenter une ou plusieurs des proprietes suivantes : 

(a) meilleure sp6ciflcite ou efficacit6 de recombinaison ; 

(b) recombinaison inverse redulte ; 

(c) absence de codon d'arret de traduction ; et 

(d) absence de segments a repetitions inversees, de fapon a eviter la formation d'§plngles a cheveux. 

20. Utilisation selon la revendication 1 9, dans laquelle soit une meilleure specificite ou efficacit6 de recombinaison est 
conferee par une ou plusieurs des proprietes suivantes : 



(a) integration excisive favoris^e ; 

(b) recombinaison excisive favorisee ; et 

(c) exigence redulte pour les facteurs de I'hdte. 

25 21. Utilisation selon la revendication 19 ou 20, dans laquelle les sites de recombinaison sont modifies pour conf6rer 
une meilleure specificite ou efficacit§ de recombinaison in vitro. 

22. Utilisation selon i'une quelconque des revendications 18 a 21 , dans laquelle ledit premier site de recombinaison 
est choisi dans le groupe constitue d'un site aff et d'un site lox. 

30 

23. Utilisation selon la revendication 22, dans laquelle ledit deuxieme site de recombinaison est choisi dans le groupe 
constitue d'un site affet d'un site /ox. 

24. Utilisation selon I'une quelconque des revendications 18 6 21 , dans laquelle une region centrale d'au moins I'un 
35 des sites de recombinaison comprend une sequence d'ADN choisie dans le groupe constitue par : 





(a) 


RKyCWGCTTTYKTRTACNAASTSGB 


<m- 


-att) 


(SEQ ID N«:l) ; 


40 


(b) 


AGCCWGCTTTYKTRTACNAACTSGB 


(m- 


■attB) 


(SEQ ID NO:2) ; 




(c) 


GTTCAGCTTTCKTRTACNAACTSGB 


(m- 


•attR) 


(SEQ ID N«:3) ; 




(d) 


AGCCWGCTTTCKTRTACNAAGTSGB 


(m- 


•attL) 


(SEQ ID N« :4) ; 


45 


(e) 


GTTCAGCTTTYKTRTACNAAGTSGB 


(m- 


attPl} (SEQ ID N'zS) .; 



et une sequence d'ADN ou d'ARN correspondante ou complementaire, dans laquelle R = AouG;K = Gou T/U ; 
Y = C ou T/U ; W = A ou T/U ; N = A ou C ou G ou T/U ; S = C ou G ; et B = C ou G ou T/U. 

50 

25. Utilisation selon I'une quelconque des revendications 18 a 21 , dans laquelle une region centrale d'au moins I'un 
des sites de recombinaison comprend une sequence d'ADN choisie dans le groupe constitue par : 
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un ADN fonctionnel, un ARN antisens, un fragment d'ACP, une prot6ine ou un fragment de prot§ine. 

32. Utilisation selon la revendication 30, dans laquelle lesdits premier et deuxi^me sites de recombinalson modifies 
ont 6te modifies pour ameliorer I'efficacit^ de recombinaison. 

33. Kit comprenant un recipient compartiment6 et renfermant dans un confinement 6tanche un premier compartiment 
contenant un acide nucl§ique tel que d6fini dans I'une quelconque des revendications 18 & 32, et au moins un 
compartiment supplemental re contenant au moins une prot6ine de recombinaison capable de recombiner un seg- 
ment d'ADN comprenant au moins I'un desdits sites de recombinaison. 

34. Kit selon la revendication 33, comprenant un premier compartiment contenant un acide nucldique qui est un ADN 
Donneur de Vecteur, un second compartiment contenant un acide nucleique qui est une molecule d'ADN de Don- 
neur d/lnsert, et ledit compartiment supplementaire contenant ladite proteine de recombinaison. 

35. Precede in vitro pour apposer un signal d'expression et un gene ou un g6ne partiel comprenant : 

(a) le melange d'une premiere molecule d'acide nucleique comprenant ledit signal d'expression et au moins 
un premier site de recombinaison et d'une seconde molecule d'acide nucleique comprenant ledit gene ou 
gene partiel et au moins un deuxi^me site de recombinaison ; et 

(b) I'incubatlon dudit melange en presence d'au moins une proteine de recombinaison, dans des conditions 
suffisantes pour provoquer la recombinaison d'au moins lesdits premier et deuxieme sites de recombinaison, 
en apposant ainsi ledit signal d'expression et ledit gene ou gene partiel. 

36. Precede selon la revendication 35, dans lequel ladite proteine de recombinaison est codee par un bacteriophage 
choisi dans le groupe constitu6 par le bacteriophage lambda, le phiSO, le P22, ie P2, le 186, le P4 et le PI . 

37. Proc6de selon la revendication 35, dans lequel ladite proteine de recombinaison est codee par le bacteriophage 
lambda. 

38. Precede selon I'une quelconque des revendications 35 & 37, utilisant un premier acide nucleique tel que d6fini 
dans I'une quelconque des revendications 1 8 a 32 et un second acide nucleique tel que defini dans I'une quelcon- 
que des revendications 18 a 32. 

39. W Composition comprenant une molecule d'acide nucleique telle que d^finie dans I'une quelconque des revendi- 
cations 1 8 a 32 et une premiere prot6ine de recombinaison. 

40. Composition selon la revendication 39 comprenant des molecules d'acide nucleique qui sont des ADN de Donneur 
de Vecteur et des ADN de Donneur d'Insert, et ladite premiere proteine de recombinaison. 

41. Composition selon la revendication 39 ou 40 comprenant en outre une deuxieme proteine de recombinaison, 
diff§rente de la premiere. 

42. Utilisation d'un precede selon I'une quelconque des revendications 1 & 1 7 pour changer des vecteurs, apposer 
des promoteurs a des genes, construire des genes pour des proteines de fusion, changer le nombre de copies, 
changer les replicons, doner en phages ou doner des produits d'ACP, des ADN genomiques ou des ADNc. 
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